Document Image Segmentation and Compression

نویسندگان

Hui Cheng

Charles A. Bouman

Jan P. Allebach

Bradley J. Lucier

Steve J. Harrington

Faouzi Kossentini

چکیده

Cheng, Hui, Ph.D., Purdue University, August, 1999. Document Image Segmentation and Compression. Major Professor: Charles A. Bouman. In the first part of this research, we propose an image segmentation algorithm called the trainable sequential MAP (TSMAP) algorithm. The TSMAP algorithm is based on a multiscale Bayesian approach. It has a novel multiscale context model which can capture complex aspects of both local and global contextual behavior. In addition, its image model uses local texture features extracted via a wavelet decomposition, and the textural information at various scales is captured by a hidden Markov model. The parameters which describe the characteristics of typical images are extracted from a database of training images and their accurate segmentations. Once the training procedure is performed, scanned documents may be segmented using a fine-to-coarse-to-fine procedure that is computationally efficient. In the second part of this research, we introduce a multilayer compression algorithm for document images. This compression algorithm first segments a scanned document image into different classes, then compresses each class using an algorithm specifically designed for that class. We also propose a rate-distortion optimized segmentation (RDOS) algorithm developed for document compression. Compared with the TSMAP algorithm, the RDOS algorithm can often result in a better rate-distortion trade-off, and produce more robust segmentations than TSMAP by eliminating those misclassifications which can cause severe artifacts. Experimental results show that, at similar bit rates, the multilayer compression algorithm using RDOS can achieve a much higher subjective quality than well-known coders such as DjVu, SPIHT, and JPEG.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Persian Printed Document Analysis and Page Segmentation

This paper presents, a hybrid method, low-resolution and high-resolution, for Persian page segmentation. In the low-resolution page segmentation, a pyramidal image structure is constructed for multiscale analysis and segments document image to a set of regions. By high-resolution page segmentation, by connected components analysis, each region is segmented to homogeneous regions and identifyi...

متن کامل

Document Analysis And Classification Based On Passing Window

In this paper we present Document analysis and classification system to segment and classify contents of Arabic document images. This system includes preprocessing, document segmentation, feature extraction and document classification. A document image is enhanced in the preprocessing by removing noise, binarization, and detecting and correcting image skew. In document segmentation, an algorith...

متن کامل

Markov Random Field Model Based Text Segmentation and Image Post Processing of Complex Scanned Documents

Haneda, Eri Ph.D., Purdue University, May 2011. Markov Random Field Model Based Text Segmentation and Image Post Processing of Complex Scanned Documents. Major Professor: Charles A. Bouman. In this dissertation, two image processing studies will be presented. The first study is segmentation for MRC document compression using an MRF model, and the second study is an automatic contrast enhancemen...

متن کامل

Document compression using rate-distortion optimized segmentation

Effective document compression algorithms require that scanned document images be first segmented into regions such as text, pictures, and background. In this paper, we present a multilayer compression algorithm for document images. This compression algorithm first segments a scanned document image into different classes, then compresses each class using an algorithm specifically designed for t...

متن کامل

Document Image Segmentation for Document Compression

متن کامل

Multilayer Document Compression Algorithm

In this paper, we propose a multilayer document compression algorithm. This algorithm first segments a scanned document image into different classes such as text, images and background, then compresses each class using an algorithm specifically designed for that class. Two algorithms are investigated for segmenting documents: a general purpose image segmentation algorithm called the trainable s...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1999

Document Image Segmentation and Compression

نویسندگان

چکیده

منابع مشابه

Persian Printed Document Analysis and Page Segmentation

Document Analysis And Classification Based On Passing Window

Markov Random Field Model Based Text Segmentation and Image Post Processing of Complex Scanned Documents

Document compression using rate-distortion optimized segmentation

Document Image Segmentation for Document Compression

Multilayer Document Compression Algorithm

عنوان ژورنال:

اشتراک گذاری